Performance of FDM Simulation of Seismic Wave Propagation using the ppOpen-APPL/FDM Library on the Intel Xeon Phi Coprocessor

نویسندگان

  • Futoshi Mori
  • Masaharu Matsumoto
  • Takashi Furumura
چکیده

We evaluated the performance of a parallel 3D FDM simulation of seismic wave propagation using the Intel Xeon Phi coprocessor. We confirmed that MPI/OpenMP hybrid parallel computing with hyper-threading is more efficient than pure MPI parallelism. The performance of the thread parallel computing was further improved by fusing the original three DO loops of major kernel routines into two DO loops. The performance of the FDM simulation with two fused DO loops was 1.7-5.1 times faster than the original code with three DO loops however no performance acceleration was achieved for a fused single DO loop calculation. This is probably due to restrictions of the current Fortran compiler optimizations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High Order Seismic Simulations on the Intel Xeon Phi Processor (Knights Landing)

We present a holistic optimization of the ADER-DG finite element software SeisSol targeting the Intel © Xeon Phi TM x200 processor, codenamed Knights Landing (KNL). SeisSol is a multi-physics software package performing earthquake simulations by coupling seismic wave propagation and the rupture process. The code was shown to scale beyond 1.5 million cores and achieved petascale performance when...

متن کامل

A Technology of 3D Elastic Wave Propagation Simulation Using Hybrid Supercomputers

We present a technology of 3D seismic field simulation for high-performance computing systems with GPUs or Intel Xeon Phi coprocessors. This technology covers adaptation of a mathematical modeling method and development of a parallel algorithm. We describe the parallel realization designed for simulation based on using staggeredgrids and 3D domain decomposition method. We study the parallel alg...

متن کامل

Efficient Hybrid Execution of C++ Applications using Intel(R) Xeon Phi(TM) Coprocessor

The introduction of Intel R © Xeon Phi TM coprocessors opened up new possibilities in development of highly parallel applications. The familiarity and flexibility of the architecture together with compiler support integrated into the Intel C++ Composer XE allows the developers to use familiar programming paradigms and techniques, which are usually not suitable for other accelerated systems. It ...

متن کامل

coprocessors with a basic N-body simulation

Intel R © Xeon Phi TM coprocessors are capable of delivering more performance and better energy efficiency than Intel R © Xeon R © processors for certain parallel applications. In this paper, we investigate the porting and optimization of a test problem for the Intel Xeon Phi coprocessor. The test problem is a basic N-body simulation, which is the foundation of a number of applications in compu...

متن کامل

Evaluation of DGEMM Implementation on Intel Xeon Phi Coprocessor

In this paper we will present a detailed study of implementing double-precision matrix-matrix multiplication (DGEMM) utilizing the Intel Xeon Phi Coprocessor. We discuss a DGEMM algorithm implementation running "natively" on the coprocessor, minimizing communication with the host CPU. We will run DGEMM across a range of matrix sizes natively as well using Intel Math Kernel Library. Our optimiza...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014